This directory contains 4 .do files:
-TeacherDesegClean.do generates the working data set of district level data used for most of the paper
-TeacherDesegAnalysis.do generates the table and figures using district level data in boht the main paper and the appendix
 
-CensusMain.do creates the needed data set and generates the corresponding tables 
-CensusFiveYearAgoOcc.do creates the needed data set and correspodning tables for the models using the Census five year occupation recalls

This directory also contains several raw data sets that are inputs for the programs described above, and 4 clean datasets used for the analysis (the 4 clean data sets can also be generated by the user with the raw data and provided programs, but are provided in final form for convenience). 
These data sets are:


District Level:
-STATE_HandEnter.xlsx 			Hand entered data through 1964 for the indicated state, entered by the author from state annual reports
-HandEntered1967.xlsx 			1967 data entered by the author from NCES (1967)
-OCR68.txt, OCR70.txt, OCR72.txt  	data from OCR surveys via Sarah Reber for 1968, 1970 and 1972
-sersYEAR-STATE.txt			data from SERS via Sarah Reber giving student desegregation levels for the pre-1964 period
-OCR_FIPS_xwalk.dta 			a crosswalk from OCR state/county codes to FIPS codes 
-reorgdata.dta				indicatorsof district reorganizations based on Cascio et al. (2013) data
-teachers_8state.dta			Clean data set for most of the district level models
-teachers_11state.dta			Clean data set for some district level models in the appendix

Census:
-usa_00070.dat				Raw Census data for main cenusus analysis  
-MainWorking.dta			Clean data set for estimating main Census results
-usa_00063.dat				Raw Census data for five year occupation recalls
-OccFiveYrs.dta				Clean data set for estimating Census results using the 5 year occupation question


Please direct inquiries to ot3@williams.edu